Goto

Collaborating Authors

 arm null



Understanding Memory-Regret Trade-Off for Streaming Stochastic Multi-Armed Bandits

arXiv.org Machine Learning

We study the stochastic multi-armed bandit problem in the $P$-pass streaming model. In this problem, the $n$ arms are present in a stream and at most $m